Overview

Dataset info

Number of variables38
Number of observations3837
Missing cells478 (0.3%)
Duplicate rows0 (0.0%)
Total size in memory1.1 MiB
Average record size in memory304.0 B

Variables types

Numeric11
Categorical3
Boolean18
Date0
URL0
Text (Unique)1
Rejected5
Unsupported0

Warnings

budget has 497 (13.0%) zeros Zeros
overseas-gross has 460 (12.0%) missing values Missing
overseas-pct has 462 (12.0%) zeros Zeros
revenues is highly correlated with overseas-gross (ρ = 0.9717238163) Rejected
studio has a high cardinality: 203 distinct values Warning
title has a high cardinality: 3801 distinct values Warning
TV_Movie has constant value "0" Rejected
Unnamed_0_x is highly correlated with Unnamed_0 (ρ = 0.9940708728) Rejected
Unnamed_0_y is highly correlated with bo_year_rank (ρ = 1) Rejected
worldwide-gross is highly correlated with revenues (ρ = 0.9906464716) Rejected

Variables

Action
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
2801
1
1036
ValueCountFrequency (%) 
0 2801 73.0%
 
1 1036 27.0%
 

Adventure
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3125
1
712
ValueCountFrequency (%) 
0 3125 81.4%
 
1 712 18.6%
 

Animation
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3591
1
 
246
ValueCountFrequency (%) 
0 3591 93.6%
 
1 246 6.4%
 

bo_year_rank
Numeric

Distinct count398
Unique (%)10.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean106.3468856
Minimum1
Maximum443
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile7
Q134
Median83
Q3155
95-th percentile294
Maximum443
Range442
Interquartile range121

Descriptive statistics

Standard deviation90.12067176
Coef of variation0.847421824
Kurtosis0.7948089266
Mean106.3468856
MAD71.73746538
Skewness1.119633974
Sum408053
Variance8121.735478
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 34.5 92.5 141.5 180.5 235.5 309.5 391. 443. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 31 0.8%
 
21 31 0.8%
 
4 31 0.8%
 
6 31 0.8%
 
12 31 0.8%
 
3 31 0.8%
 
5 31 0.8%
 
7 31 0.8%
 
8 30 0.8%
 
10 30 0.8%
 
Other values (388) 3529 92.0%
 

Minimum 5 values

ValueCountFrequency (%) 
1 30 0.8%
 
2 31 0.8%
 
3 31 0.8%
 
4 31 0.8%
 
5 31 0.8%
 

Maximum 5 values

ValueCountFrequency (%) 
443 1 < 0.1%
 
438 1 < 0.1%
 
436 1 < 0.1%
 
435 1 < 0.1%
 
433 1 < 0.1%
 

budget
Numeric

Distinct count383
Unique (%)10.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean39998995.12
Minimum0
Maximum500000000
Zeros (%)13.0%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q17400000
Median25000000
Q355000000
95-th percentile150000000
Maximum500000000
Range500000000
Interquartile range47600000

Descriptive statistics

Standard deviation47430268.89
Coef of variation1.185786511
Kurtosis7.013132631
Mean39998995.12
MAD33984846.5
Skewness2.192316187
Sum1.534761443e+11
Variance2.249630407e+15
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000e+00 4.65000e+01 1.12500e+05 9.83843e+05 1.04468e+06 ... 1.81500e+08 1.97500e+08 2.03500e+08 2.65000e+08 5.00000e+08], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 497 13.0%
 
30000000 141 3.7%
 
20000000 136 3.5%
 
40000000 129 3.4%
 
25000000 120 3.1%
 
35000000 107 2.8%
 
50000000 106 2.8%
 
15000000 98 2.6%
 
60000000 92 2.4%
 
10000000 92 2.4%
 
Other values (373) 2319 60.4%
 

Minimum 5 values

ValueCountFrequency (%) 
0 497 13.0%
 
93 1 < 0.1%
 
4000 1 < 0.1%
 
7000 1 < 0.1%
 
8000 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
500000000 1 < 0.1%
 
380000000 1 < 0.1%
 
356000000 1 < 0.1%
 
300000000 2 0.1%
 
280000000 1 < 0.1%
 

Comedy
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
2413
1
1424
ValueCountFrequency (%) 
0 2413 62.9%
 
1 1424 37.1%
 

Crime
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3259
1
 
578
ValueCountFrequency (%) 
0 3259 84.9%
 
1 578 15.1%
 

Documentary
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3741
1
 
96
ValueCountFrequency (%) 
0 3741 97.5%
 
1 96 2.5%
 

domestic-gross
Numeric

Distinct count1746
Unique (%)45.5%
Missing (%)0.4%
Missing (n)15
Infinite (%)0.0%
Infinite (n)0
Mean55993982.31
Minimum400
Maximum936700000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum400
5-th percentile159150
Q16700000
Median31600000
Q371950000
95-th percentile200795000
Maximum936700000
Range936699600
Interquartile range65250000

Descriptive statistics

Standard deviation77861124.8
Coef of variation1.390526653
Kurtosis18.69540107
Mean55993982.31
MAD50684872.85
Skewness3.390172791
Sum2.140090004e+11
Variance6.062354755e+15
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1100000 27 0.7%
 
1300000 19 0.5%
 
1200000 17 0.4%
 
1000000 15 0.4%
 
1400000 15 0.4%
 
2200000 14 0.4%
 
1600000 14 0.4%
 
1700000 14 0.4%
 
2300000 13 0.3%
 
3000000 13 0.3%
 
Other values (1735) 3661 95.4%
 
(Missing) 15 0.4%
 

Minimum 5 values

ValueCountFrequency (%) 
400 1 < 0.1%
 
1000 1 < 0.1%
 
1700 2 0.1%
 
1800 1 < 0.1%
 
3900 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
936700000 1 < 0.1%
 
858400000 1 < 0.1%
 
749800000 1 < 0.1%
 
700100000 1 < 0.1%
 
678800000 1 < 0.1%
 

domestic-pct
Numeric

Distinct count906
Unique (%)23.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean56.95994266
Minimum0
Maximum100
Zeros (%)0.5%
Mini histogram

Quantile statistics

Minimum0
5-th percentile11.98
Q137.6
Median53.4
Q377.3
95-th percentile100
Maximum100
Range100
Interquartile range39.7

Descriptive statistics

Standard deviation27.0933746
Coef of variation0.475656634
Kurtosis-0.8149988889
Mean56.95994266
MAD22.38286128
Skewness0.08226896555
Sum218555.3
Variance734.0509471
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.000e+00 5.000e-02 5.500e-01 2.005e+01 2.895e+01 ... 6.045e+01 7.745e+01 9.655e+01 9.995e+01 1.000e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
100 462 12.0%
 
0 18 0.5%
 
51.2 13 0.3%
 
55.8 12 0.3%
 
36.4 11 0.3%
 
45.2 11 0.3%
 
29.4 11 0.3%
 
45.7 11 0.3%
 
57.9 11 0.3%
 
43.5 11 0.3%
 
Other values (896) 3266 85.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 18 0.5%
 
0.1 6 0.2%
 
0.2 6 0.2%
 
0.3 3 0.1%
 
0.4 4 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
100 462 12.0%
 
99.9 7 0.2%
 
99.8 4 0.1%
 
99.7 2 0.1%
 
99.6 1 < 0.1%
 

Drama
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
2010
1
1827
ValueCountFrequency (%) 
0 2010 52.4%
 
1 1827 47.6%
 

Family
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3374
1
 
463
ValueCountFrequency (%) 
0 3374 87.9%
 
1 463 12.1%
 

Fantasy
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3434
1
 
403
ValueCountFrequency (%) 
0 3434 89.5%
 
1 403 10.5%
 

Film_Genre
Categorical

Distinct count19
Unique (%)0.5%
Missing (%)< 0.1%
Missing (n)1
Drama
895
Comedy
808
Action
686
Other values (15)
1447
ValueCountFrequency (%) 
Drama 895 23.3%
 
Comedy 808 21.1%
 
Action 686 17.9%
 
Adventure 275 7.2%
 
Horror 196 5.1%
 
Thriller 177 4.6%
 
Crime 166 4.3%
 
Animation 120 3.1%
 
Romance 104 2.7%
 
Fantasy 88 2.3%
 
Other values (8) 321 8.4%
 
Max length15
Mean length6.44357571
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesTrue
Contains non-wordsTrue

History
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3676
1
 
161
ValueCountFrequency (%) 
0 3676 95.8%
 
1 161 4.2%
 

Horror
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3477
1
 
360
ValueCountFrequency (%) 
0 3477 90.6%
 
1 360 9.4%
 

imdb_id
Categorical, Unique

First 5 values
tt0096734
tt0096754
tt0096794
tt0096874
tt0096895
Last 5 values
tt8385474
tt8663516
tt8695030
tt8772262
tt9541602

First 5 values

ValueCountFrequency (%) 
tt0096734 1 < 0.1%
 
tt0096754 1 < 0.1%
 
tt0096794 1 < 0.1%
 
tt0096874 1 < 0.1%
 
tt0096895 1 < 0.1%
 

Last 5 values

ValueCountFrequency (%) 
tt9541602 1 < 0.1%
 
tt8772262 1 < 0.1%
 
tt8695030 1 < 0.1%
 
tt8663516 1 < 0.1%
 
tt8385474 1 < 0.1%
 

Music
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3718
1
 
119
ValueCountFrequency (%) 
0 3718 96.9%
 
1 119 3.1%
 

Mystery
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3496
1
 
341
ValueCountFrequency (%) 
0 3496 91.1%
 
1 341 8.9%
 

overseas-gross
Numeric

Distinct count1703
Unique (%)44.4%
Missing (%)12.0%
Missing (n)460
Infinite (%)0.0%
Infinite (n)0
Mean83057641.13
Minimum100
Maximum2029200000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum100
5-th percentile390800
Q18000000
Median32800000
Q391700000
95-th percentile349600000
Maximum2029200000
Range2029199900
Interquartile range83700000

Descriptive statistics

Standard deviation143810476.9
Coef of variation1.731453903
Kurtosis31.71050294
Mean83057641.13
MAD85492413.14
Skewness4.381861616
Sum2.804856541e+11
Variance2.068145327e+16
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1100000 15 0.4%
 
1200000 15 0.4%
 
1900000 14 0.4%
 
3700000 14 0.4%
 
1300000 13 0.3%
 
2800000 12 0.3%
 
2200000 12 0.3%
 
1400000 12 0.3%
 
4200000 11 0.3%
 
5400000 11 0.3%
 
Other values (1692) 3248 84.6%
 
(Missing) 460 12.0%
 

Minimum 5 values

ValueCountFrequency (%) 
100 1 < 0.1%
 
900 1 < 0.1%
 
1700 1 < 0.1%
 
4500 1 < 0.1%
 
5300 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
2029200000 1 < 0.1%
 
1937900000 1 < 0.1%
 
1528100000 1 < 0.1%
 
1369500000 1 < 0.1%
 
1163000000 1 < 0.1%
 

overseas-pct
Numeric

Distinct count906
Unique (%)23.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean43.04005734
Minimum0
Maximum100
Zeros (%)12.0%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q122.7
Median46.6
Q362.4
95-th percentile88.02
Maximum100
Range100
Interquartile range39.7

Descriptive statistics

Standard deviation27.0933746
Coef of variation0.6294920656
Kurtosis-0.8149988889
Mean43.04005734
MAD22.38286128
Skewness-0.08226896555
Sum165144.7
Variance734.0509471
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.000e+00 5.000e-02 3.450e+00 2.255e+01 3.955e+01 ... 7.105e+01 7.995e+01 9.945e+01 9.995e+01 1.000e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 462 12.0%
 
100 18 0.5%
 
48.8 13 0.3%
 
44.2 12 0.3%
 
55.5 11 0.3%
 
56.5 11 0.3%
 
63.6 11 0.3%
 
70.6 11 0.3%
 
54.3 11 0.3%
 
54 11 0.3%
 
Other values (896) 3266 85.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 462 12.0%
 
0.1 7 0.2%
 
0.2 4 0.1%
 
0.3 2 0.1%
 
0.4 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
100 18 0.5%
 
99.9 6 0.2%
 
99.8 6 0.2%
 
99.7 3 0.1%
 
99.6 4 0.1%
 

popularity
Numeric

Distinct count3496
Unique (%)91.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean13.62128303
Minimum0.6
Maximum452.653
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.6
5-th percentile3.7568
Q17.938
Median11.515
Q316.141
95-th percentile28.891
Maximum452.653
Range452.053
Interquartile range8.203

Descriptive statistics

Standard deviation13.32344393
Coef of variation0.9781342842
Kurtosis386.9207152
Mean13.62128303
MAD6.272453193
Skewness14.7621721
Sum52264.863
Variance177.5141582
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0.6 0.6085 1.7825 4.934 5.9875 ... 30.814 39.9485 48.0325 148.335 452.653 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.6 10 0.3%
 
1.4 4 0.1%
 
9.567 3 0.1%
 
15.907 3 0.1%
 
10.869 3 0.1%
 
13.92 3 0.1%
 
13.506 3 0.1%
 
11.923 3 0.1%
 
10.472 3 0.1%
 
12.067 3 0.1%
 
Other values (3486) 3799 99.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0.6 10 0.3%
 
0.617 1 < 0.1%
 
0.679 1 < 0.1%
 
0.745 1 < 0.1%
 
0.746 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
452.653 1 < 0.1%
 
270.012 1 < 0.1%
 
262.515 1 < 0.1%
 
151.174 1 < 0.1%
 
145.496 1 < 0.1%
 

revenues
Highly correlated

This variable is highly correlated with overseas-gross and should be ignored for analysis

Correlation0.9717238163

Romance
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3098
1
739
ValueCountFrequency (%) 
0 3098 80.7%
 
1 739 19.3%
 

runtime
Numeric

Distinct count131
Unique (%)3.4%
Missing (%)< 0.1%
Missing (n)1
Infinite (%)0.0%
Infinite (n)0
Mean110.0625652
Minimum27
Maximum338
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum27
5-th percentile86
Q196
Median106
Q3121
95-th percentile147
Maximum338
Range311
Interquartile range25

Descriptive statistics

Standard deviation20.25014999
Coef of variation0.1839876252
Kurtosis5.863595766
Mean110.0625652
MAD15.41567891
Skewness1.315762642
Sum422200
Variance410.0685748
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
100 119 3.1%
 
105 102 2.7%
 
93 97 2.5%
 
97 96 2.5%
 
96 96 2.5%
 
101 95 2.5%
 
95 92 2.4%
 
106 90 2.3%
 
98 87 2.3%
 
91 87 2.3%
 
Other values (120) 2875 74.9%
 

Minimum 5 values

ValueCountFrequency (%) 
27 1 < 0.1%
 
37 1 < 0.1%
 
38 1 < 0.1%
 
39 2 0.1%
 
41 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
338 1 < 0.1%
 
216 1 < 0.1%
 
214 1 < 0.1%
 
213 1 < 0.1%
 
201 1 < 0.1%
 

Science_Fiction
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3415
1
 
422
ValueCountFrequency (%) 
0 3415 89.0%
 
1 422 11.0%
 

studio
Categorical

Distinct count203
Unique (%)5.3%
Missing (%)< 0.1%
Missing (n)1
Uni.
 
371
WB
 
356
Fox
 
339
Other values (199)
2770
ValueCountFrequency (%) 
Uni. 371 9.7%
 
WB 356 9.3%
 
Fox 339 8.8%
 
BV 279 7.3%
 
Sony 255 6.6%
 
Par. 230 6.0%
 
LGF 105 2.7%
 
NL 103 2.7%
 
FoxS 103 2.7%
 
Focus 90 2.3%
 
Other values (192) 1605 41.8%
 
Max length11
Mean length3.478759447
Min length2
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Thriller
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
2792
1
1045
ValueCountFrequency (%) 
0 2792 72.8%
 
1 1045 27.2%
 

title
Categorical

Distinct count3801
Unique (%)99.1%
Missing (%)0.0%
Missing (n)0
Unknown
 
2
Life
 
2
Raavan
 
2
Other values (3798)
3831
ValueCountFrequency (%) 
Unknown 2 0.1%
 
Life 2 0.1%
 
Raavan 2 0.1%
 
Fantastic Four 2 0.1%
 
Kabali 2 0.1%
 
The Lion King 2 0.1%
 
Point Break 2 0.1%
 
Aladdin 2 0.1%
 
The Mummy 2 0.1%
 
Frozen 2 0.1%
 
Other values (3791) 3817 99.5%
 
Max length82
Mean length14.91842585
Min length1
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

TV_Movie
Constant

This variable is constant and should be ignored for analysis

Constant value0

Unnamed_0
Numeric

Distinct count3837
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1918
Minimum0
Maximum3836
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile191.8
Q1959
Median1918
Q32877
95-th percentile3644.2
Maximum3836
Range3836
Interquartile range1918

Descriptive statistics

Standard deviation1107.79082
Coef of variation0.5775760269
Kurtosis-1.2
Mean1918
MAD959.2499348
Skewness0
Sum7359366
Variance1227200.5
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 3836.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
2676 1 < 0.1%
 
2700 1 < 0.1%
 
649 1 < 0.1%
 
2696 1 < 0.1%
 
645 1 < 0.1%
 
2692 1 < 0.1%
 
641 1 < 0.1%
 
2688 1 < 0.1%
 
637 1 < 0.1%
 
Other values (3827) 3827 99.7%
 

Minimum 5 values

ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
3836 1 < 0.1%
 
3835 1 < 0.1%
 
3834 1 < 0.1%
 
3833 1 < 0.1%
 
3832 1 < 0.1%
 

Unnamed_0_x
Highly correlated

This variable is highly correlated with Unnamed_0 and should be ignored for analysis

Correlation0.9940708728

Unnamed_0_y
Highly correlated

This variable is highly correlated with bo_year_rank and should be ignored for analysis

Correlation1

vote_average
Numeric

Distinct count60
Unique (%)1.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean6.37211363
Minimum0
Maximum10
Zeros (%)0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile5
Q15.9
Median6.4
Q37
95-th percentile7.7
Maximum10
Range10
Interquartile range1.1

Descriptive statistics

Standard deviation0.8407405492
Coef of variation0.1319406084
Kurtosis2.08536162
Mean6.37211363
MAD0.6569979329
Skewness-0.4840546078
Sum24449.8
Variance0.7068446711
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 2.95 4.05 4.35 4.85 ... 7.45 7.65 7.95 8.45 10. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6.2 213 5.6%
 
6.3 190 5.0%
 
6.1 188 4.9%
 
6.6 183 4.8%
 
6.4 183 4.8%
 
6 169 4.4%
 
5.9 169 4.4%
 
6.5 167 4.4%
 
6.8 163 4.2%
 
6.7 161 4.2%
 
Other values (50) 2051 53.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0 2 0.1%
 
2.5 2 0.1%
 
2.7 1 < 0.1%
 
2.9 1 < 0.1%
 
3 2 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
10 1 < 0.1%
 
8.6 1 < 0.1%
 
8.5 2 0.1%
 
8.4 9 0.2%
 
8.3 14 0.4%
 

War
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3729
1
 
108
ValueCountFrequency (%) 
0 3729 97.2%
 
1 108 2.8%
 

Western
Boolean

Distinct count2
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
0
3799
1
 
38
ValueCountFrequency (%) 
0 3799 99.0%
 
1 38 1.0%
 

worldwide-gross
Highly correlated

This variable is highly correlated with revenues and should be ignored for analysis

Correlation0.9906464716

year
Numeric

Distinct count31
Unique (%)0.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2008.022935
Minimum1989
Maximum2019
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1989
5-th percentile1995
Q12003
Median2009
Q32014
95-th percentile2018
Maximum2019
Range30
Interquartile range11

Descriptive statistics

Standard deviation6.992025507
Coef of variation0.003482044645
Kurtosis-0.3584544594
Mean2008.022935
MAD5.761877971
Skewness-0.5325710856
Sum7704784
Variance48.88842069
Memory size30.1 KiB
Histogram
Histogram with fixed size bins (bins=31)
Histogram
Histogram with variable size bins (bins=[1989. 1989.5 1994.5 1998.5 2000.5 2005.5 2017.5 2019. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2016 221 5.8%
 
2015 210 5.5%
 
2011 205 5.3%
 
2008 197 5.1%
 
2006 197 5.1%
 
2014 196 5.1%
 
2013 190 5.0%
 
2012 181 4.7%
 
2017 180 4.7%
 
2009 179 4.7%
 
Other values (21) 1881 49.0%
 

Minimum 5 values

ValueCountFrequency (%) 
1989 29 0.8%
 
1990 26 0.7%
 
1991 23 0.6%
 
1992 32 0.8%
 
1993 30 0.8%
 

Maximum 5 values

ValueCountFrequency (%) 
2019 67 1.7%
 
2018 151 3.9%
 
2017 180 4.7%
 
2016 221 5.8%
 
2015 210 5.5%
 

Correlations

Missing values

Sample

First rows

ActionAdventureAnimationbo_year_rankbudgetComedyCrimeDocumentarydomestic-grossdomestic-pctDramaFamilyFantasyFilm_GenreHistoryHorrorimdb_idMusicMysteryoverseas-grossoverseas-pctpopularityrevenuesRomanceruntimeScience_FictionstudioThrillertitleTV_MovieUnnamed_0Unnamed_0_xUnnamed_0_yvote_averageWarWesternworldwide-grossyear
0001294000000000339700000.037.8010Animation00tt026654300559500000.062.234.1789403355360100.00BV0Finding Nemo00417.800899200000.02003.0
1000255000000100329700000.048.7100Comedy00tt010983000347700000.051.337.2046779453991142.00Par.0Forrest Gump01518.400677400000.01994.0
2000915000000000130100000.036.5100Drama00tt016954700226200000.063.524.5043562966010122.00DW0American Beauty02688.000356300000.01999.0
300088128000000104200000.010.5100Drama00tt01686291035800000.089.513.044400318790141.00FL0Dancer in the Dark038877.90040000000.02000.0
411099000000000063800000.024.2001Adventure00tt011911600200100000.075.834.9692639201800126.01Sony1The Fifth Element04987.400263900000.01997.0
50001690000401000.04.1100Drama00tt0314412009300000.095.95.30097269541106.00SPC0My Life Without Me05111686.3009700000.02003.0
61104140000000000305400000.046.7001Adventure00tt032598000348900000.053.341.6796550112240143.00BV0Pirates of the Caribbean The Curse of the Blac...061237.700654300000.02003.0
7100263000000001070100000.038.7000Action00tt026669700110900000.061.325.9521809490450111.00Mira.0Kill Bill Vol. 10713257.900180900000.02003.0
8000567200000000062700000.064.7100Drama00tt04187630034200000.035.314.157968899980123.00Uni.0Jarhead0814556.51096900000.02005.0
90001314000000000101200000.063.6000Western00tt01056950058000000.036.421.7591591574470131.00WB0Unforgiven0918127.901159200000.01992.0

Last rows

ActionAdventureAnimationbo_year_rankbudgetComedyCrimeDocumentarydomestic-grossdomestic-pctDramaFamilyFantasyFilm_GenreHistoryHorrorimdb_idMusicMysteryoverseas-grossoverseas-pctpopularityrevenuesRomanceruntimeScience_FictionstudioThrillertitleTV_MovieUnnamed_0Unnamed_0_xUnnamed_0_yvote_averageWarWesternworldwide-grossyear
38270002275000000100120600000.030.5000Comedy00tt691160800274400000.069.512.9081672255251113.00Uni.0Mamma Mia Here We Go Again038279686217.200395000000.02018.0
3828000327108936000053200.043.6100Drama00tt13370510068800.056.41.809482980115.00IFC0Police Adjective0382896943266.600122000.02009.0
3829000147000012600000.054.0100Drama00tt07963070010700000.046.05.687233113910106.00Wein.0Under the Same Moon0382997041467.30023300000.02008.0
3830100912500000000035400000.065.8000Action00tt68508200018400000.034.216.008488187230102.00STX1Peppermint038309705906.50053800000.02018.0
3831010329500000000088800000.039.2011Adventure00tt081425500137700000.060.825.4472264972090119.00Fox0Percy Jackson The Olympians The Lightning Thief038319710316.100226500000.02010.0
3832000534800000010067400000.055.9110Comedy00tt74015880053200000.044.118.839147000000118.00Par.0Instant Family038329711527.600120600000.02018.0
38330001041680000010010500000.020.7100Comedy00tt28707560040500000.079.39.57551029361197.00SPC0Magic in the Moonlight0383397151036.50051000000.02014.0
383400057000035400000.096.8100Thriller01tt6722030011200000.03.217.811257243050102.00SGem1The Intruder038349717566.20036600000.02019.0
38350002020000000000175000000.068.6000Thriller01tt68571120080100000.031.437.4712546644600116.00Uni.1Us038359720197.000255100000.02019.0
383600018301002100000.056.1100Comedy00tt5884230001700000.043.97.57424000000101.00Annapurna0Brads Status0383697241826.1003800000.02017.0